Hellary Zhang
7/15/2022

README for processing GIS data to match student geocodes to Census Block Groups.

SOFTWARE:


INPUTS:
    - geo centroids.shp - a map of Boston's geocodes obtained from BPS
    - tl_2010_25_bg00.shp - Tiger/Line shapefiles of the 2000 Census Block Groups obtained from
      https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-line-file.2000.html
    - R13124478.txt, which is converted to "census_2000.dta" - 2000 Census Block Group data obtained from Social Explorer

STEPS:
	1) Use QGIS to intersect "geo centroids.shp" and "tl_2010_25_bg00.shp." Use "T1_2000_intersect.qgz" to replicate.
	   This will output the file geo_census_intersect_2000_novars.csv, which creates a mapping of Boston geocodes to 2000
	   Census Block Groups. To output this file, right click on "Intersection", click on "Export," go to "Save Features As."
		 Select where you want to save the .csv and name it "geo_census_intersect_2000_novars.csv."
		 		- For this replication package, it is saved in the filepath denoted by global $raw_data_census
				  (see set_paths.ado for full filepath)
	2) Later, when you run Master.do, one section of a_gen_prelim_data.do will run the STATA do-file "create_census_2000_bg.do"
	   which merges geo_census_intersect_2000_novars.csv with census_2000.dta, which includes all the 2000 Census Block Group
		 characteristics for Table 1 Balance checks.

INSTALL: https://www.qgis.org/en/site/
    - The geocodes in the paper were computed using version 3.22.6 'Białowieża'
